首页> 外文OA文献 >Combining vocal tract length normalization with hierarchial linear transformations
【2h】

Combining vocal tract length normalization with hierarchial linear transformations

机译:将声道长度归一化与分层线性变换相结合

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Recent research has demonstrated the effectiveness of vocal tract length normalization (VTLN) as a rapid adaptation technique for statistical parametric speech synthesis. VTLN produces speech with naturalness preferable to that of MLLR-based adaptation techniques, being much closer in quality to that generated by the original average voice model. However with only a single parameter, VTLN captures very few speaker specific characteristics when compared to linear transform based adaptation techniques. This paper proposes that the merits of VTLN can be combined with those of linear transform based adaptation in a hierarchial Bayesian framework, where VTLN is used as the prior information. A novel technique for propagating the gender information from the VTLN prior through constrained structural maximum a posteriori linear regression (CSMAPLR) adaptation is presented. Experiments show that the resulting transformation has improved speech quality with better naturalness, intelligibility and improved speaker similarity.
机译:最近的研究表明,声道长度归一化(VTLN)作为统计参数语音合成的快速自适应技术的有效性。 VTLN产生的语音自然度优于基于MLLR的自适应技术,其质量与原始平均语音模型产生的语音质量非常接近。但是,与基于线性变换的自适应技术相比,VTLN仅具有单个参数,就只能捕获很少的扬声器特定特性。本文提出在层级贝叶斯框架中可以将VTLN的优点与基于线性变换的适应性相结合,其中VTLN被用作先验信息。提出了一种新技术,用于通过约束结构最大值后验线性回归(CSMAPLR)适应从VTLN传播性别信息。实验表明,由此产生的转换改善了语音质量,具有更好的自然性,清晰度和说话者相似度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号